Experimental: Unified LSP server in rewatch by nojaf · Pull Request #8243 · rescript-lang/rescript

nojaf · 2026-02-09T12:04:15Z

This branch explores embedding a full LSP server directly into the rescript binary (rescript lsp), replacing the current architecture where a Node.js extension mediates between the editor and separate build/analysis processes.

The core idea

Today, the ReScript editor experience involves three processes: a Node.js VS Code extension, the rescript build watcher, and the rescript-editor-analysis.exe binary. They communicate through files on disk — the editor extension launches builds, waits for artifacts, then shells out to the analysis binary for each request.

This branch collapses the build system and LSP server into a single Rust process using tower-lsp. The build state lives in memory, and analysis requests shell out to the same rescript-editor-analysis.exe but with source code passed via stdin instead of being read from disk.

No temp files — stdin everywhere

Both bsc and the analysis binary receive source code via stdin rather than through temporary files. For didChange (unsaved edits), bsc -bs-read-stdin produces diagnostics without writing anything to disk. For analysis requests (hover, completion, code actions, etc.), the analysis binary receives a JSON blob on stdin containing the source text, cursor position, and package metadata. The OCaml analysis code was refactored with FromSource variants that parse from a string rather than opening files — so everything works correctly on unsaved editor buffers.

Separate build profile: `lib/lsp`

The LSP server writes its build artifacts to lib/lsp/ instead of lib/bs/. This means it doesn't conflict with rescript build or rescript build -w running in a terminal — both can operate independently on the same project without stepping on each other's artifacts.

Initial build: typecheck only

On initialized, the server runs a full build but only goes as far as producing .cmt/.cmi files (the TypecheckOnly profile). It deliberately skips JS emission. This gets the editor operational as fast as possible — type information for hover, completion, go-to-definition etc. is all available, without paying the cost of generating JavaScript for every module upfront.

Smart incremental builds on save

When a file is saved, the server runs a two-phase incremental build:

Emit JS for the dependency closure — the server computes the transitive imports of the saved file and only emits JavaScript for that file and its dependencies. Modules outside this closure are skipped entirely. So saving a module produces JS for it and any imports that haven't been compiled yet — not the entire project.
Typecheck reverse dependencies — modules that transitively depend on the saved file are re-typechecked to surface errors caused by API changes (e.g. a removed export). This gives you project-wide diagnostics on save — if you rename a function, you immediately see errors in every file that uses it, even files you don't have open. No JS is emitted for these — they get their JS when they are themselves saved.

What's implemented

All standard analysis endpoints are wired up: completion (with resolve), hover, signature help, go to definition, type definition, references, rename (with prepare), document symbols, code lens, inlay hints, semantic tokens, code actions, and formatting.

Observability

Every LSP request and build operation is traced with OpenTelemetry spans, viewable in Jaeger. This makes it straightforward to profile request latency and understand what the server is doing.

Test infrastructure

Each endpoint has integration tests using vscode-languageserver-protocol that boot a real LSP server in a sandbox, send requests, and snapshot both the results and the OTEL trace structure.

What's not here yet

workspace/didChangeWatchedFiles — handling external file changes (git checkout, etc.)
Multi-workspace / monorepo support
createInterface and openCompiled custom commands

This is an experiment to validate the architecture. If it proves useful, individual pieces can be split into focused PRs.

pkg-pr-new · 2026-02-09T17:38:30Z

Open in StackBlitz

rescript

npm i https://pkg.pr.new/rescript@8243

@rescript/darwin-arm64

npm i https://pkg.pr.new/@rescript/darwin-arm64@8243

@rescript/darwin-x64

npm i https://pkg.pr.new/@rescript/darwin-x64@8243

@rescript/linux-arm64

npm i https://pkg.pr.new/@rescript/linux-arm64@8243

@rescript/linux-x64

npm i https://pkg.pr.new/@rescript/linux-x64@8243

@rescript/runtime

npm i https://pkg.pr.new/@rescript/runtime@8243

@rescript/win32-x64

npm i https://pkg.pr.new/@rescript/win32-x64@8243

commit: 9f6f81f

nojaf · 2026-02-25T13:18:13Z

A bit of an update on this PR:

I'm currently working on a new side project in ReScript where an AI/LLM/Claude drives all things coding. I boss it around and it writes the ReScript code for me. While scratching that itch, I'm using the LSP server from this PR in Zed:

.zed/settings.json:

{
  "lsp": {
    "rescript-language-server": {
      "binary": {
        "path": "/Users/nojaf/.bun/bin/bun",
        "arguments": [
          "--bun",
          "/Users/nojaf/Projects/rescript/cli/rescript.js",
          "lsp",
        ],
        "env": {
          "OTEL_EXPORTER_OTLP_ENDPOINT": "http://localhost:4707",
        },
      },
      "initialization_options": {
        "queue_debounce_ms": 50,
        "diagnostics_http": 12307,
      },
    },
  },
}

The nice thing about this LSP server is that it has a diagnostics endpoint the LLM can call. The LLM calls this after making edits and has a way to clean up after itself. I wish this was more of an industry standard, but since I use ACP with Claude/Zed, I lack support for this (please upvote).

This way of working also revealed a lot of use cases to consider. LLMs will make frequent file edits and update files in a certain order, for example creating an API change in one file and updating other files a few steps later. LLMs also tend to delete and create files. These patterns can be tricky to handle.

Another great UX/DX thing in the PR is that saved files compile to JS. In practice I start vite in a shell and don't think about it anymore. The LLM makes changes and the whole thing just updates accordingly. Very productive with React Fast Refresh.

When the LSP doesn't work, I have the LLM report a problem to another endpoint on the internal HTTP server. That puts a marker in the OTel trace showing something unusual happened. I built a small custom OTel debug tool that digests all telemetry data, saves it to a local SQLite database, and exposes it in a useful way. Another LLM can then investigate when a llm.report span id is passed. In essence you say: "hey look at the trace, weird stuff happened here" and it will investigate what happened based on the trace.

This is a very effective way to troubleshoot, but I still find genuine gaps on a weekly basis. The LSP has a lot of new scenarios Rewatch never had to account for.

In conclusion, I'm still experimenting and learning a lot with this PR. I can't say where this will end or what I'll do with it afterward. This is also why I haven't circled back to #8241. The test infra has always been a carved-out part of that PR, but overall I'm not sure I want to keep it as a separate thing. Things are still too much in flux right now.

Having a lot of fun though!

…g#8291) * Fix rewatch panic when package.json has no "name" field * CHANGELOG

The modulePath for child modules used the child's own name instead of the parent's name. Since qualifiedName prepends structure.name, this caused the child name to appear twice (e.g. "Impl.Impl" instead of "Event-WebAPI.Impl"), making every module with an identically-named sub-module collide on the UNIQUE constraint in rescript.db.

Project-wide symbol search via workspace/symbol. The Rust LSP collects .ast/.iast paths from the build state (filtering by CompilationStage) and shells out to a new analysis binary subcommand (rewatch workspaceSymbol) that deserializes the ASTs and walks them with Ast_iterator to find matching symbols. Also documents rescript sync / rescript.db in LSP.md.

Three issues prevented find-references from working across packages: - build_file_sets only included root package modules in projectFiles, so sibling workspace packages were never searched. Now uses is_root || is_local_dep to include all local packages. - Cmt.fullsFromModule called fullFromUri which resolves packages via global Packages.state — never populated in the rewatch code path. Now uses the passed package directly, matching loadFullCmtWithPackage. - cmtPosToPosition could produce negative line/character values from compiler locations with pos_cnum < pos_bol, causing JSON deserialization failures that silently dropped entire result sets. Now clamps to 0.

Track cross-module references in rescript.db so AI agents can answer "which parts of dependency X does this project use?" without grep. OCaml side (LlmIndex.ml): - Extract usages from .cmt externalReferences with namespace resolution - For modules with .resi, load .cmti for structure but .cmt for usages Rust side (llm_index.rs): - Add usages table schema, insert_usages two-pass insertion - Rename file_hash → cmt_hash (hash the typed tree, not the interface) - Use unchecked_transaction for RAII rollback safety in run_sync - Typed deserialization (AnalysisModule) replaces raw serde_json::Value Incremental sync (lsp/queue/db_sync.rs): - Background DbSyncQueue with debounce and per-project coalescing - cmt_hash comparison skips unchanged modules (avoids analysis binary) - OTEL instrumentation with parent span propagation into spawn_blocking LSP wiring (lsp/queue.rs, lsp.rs): - trigger_initial_db_sync after first build populates usages immediately - build_sync_event with absolute path matching for touched_files filter - ensure_runtime_module_data so sync works before first hover/completion - file_build::run now returns touched_files for downstream sync

CompilationStage::SourceDirty was a lossy representation — the build system knew which file changed (.res vs .resi) at entry points like mark_file_parse_dirty, but discarded that information into a single undifferentiated variant. Split into three variants: - SourceImplementationDirty: only .res changed - SourceInterfaceDirty: only .resi changed - SourceBothDirty: both changed, new module, deleted dep, or unknown mark_file_parse_dirty now captures the is_interface flag from find_module_for_file and sets the precise variant, with upgrade logic when both sides become dirty. This is Phase 1 (domain modeling). The parse and compile loops still process both sides unconditionally. Phase 2 will use the split to skip redundant dependent typechecks when .cmi cannot have changed.

When a module has a .resi file and only the .res was saved (SourceImplementationDirty), the .cmi is derived exclusively from the .resi and cannot change. In this case, typecheck_dependents is skipped entirely since no dependent can have new type errors. This avoids re-typechecking the entire reverse-dependency closure on every implementation-only save for modules with interfaces.

Extend the .cmi stability check to also cover modules without .resi files. Snapshots .cmi hashes before compilation and compares after — if the hash didn't change (e.g. body-only edit), typecheck_dependents is skipped for that module too. Extract snapshot_cmi_state and modules_with_cmi_changed as helper functions to keep build_batch readable as a high-level flow.

Previously the LSP's background db_sync queue only updated the usages table after each build. Types, values, fields, constructors, aliases, and nested modules remained stale until a manual `rescript sync`. Now process_batch refreshes all module child data by deleting old rows and re-inserting from analysis output. Top-level module rows are preserved to keep foreign key references intact. New modules not yet in the DB are fully inserted. Enables PRAGMA foreign_keys=ON for cascading nested module deletes. Adds polling-based integration tests that verify values and types in rescript.db are updated after an LSP save.

The LSP's db_sync queue now creates rescript.db automatically on the first sync event instead of requiring `rescript sync` to be run first. Runtime modules (Stdlib_Array, Pervasives, etc.) are also sent to the analysis binary so they appear in the modules and usages tables. The LLM index usage extraction now resolves re-exporting modules like Stdlib to their leaf modules (e.g. "Stdlib" + ["Array", "find"] becomes "Stdlib_Array" + ["find"]), making the usages table consistent across runtime and namespaced packages. DB creation logic is shared between `rescript sync` (CLI) and the LSP via `llm_index::create_db`, which also cleans up WAL/SHM sidecar files to prevent disk I/O errors when recreating a DB that was opened by another process.

…e span

Add PRAGMA busy_timeout to test DB connections so readers wait instead of failing immediately when db_sync holds a write lock. Catch transient errors in waitForDb polling loop to retry on "database is locked". Increase test timeouts to 120s for db-sync-initial, db-sync-incremental, and workspace-symbol tests — on Windows CI the background db_sync analysis binary delays OTEL span export past the 60s budget.

Extract ld_mutable from the typedtree in the analysis binary, emit it in the llmIndex JSON output, and store it as an INTEGER column in the fields table.

Circular dependency errors were silently dropped because parse_compiler_output() only recognises bsc output formats. Now the compile loop creates BscDiagnostic entries directly for each file in a detected cycle. Also adds a fallback: when compile_errors is non-empty but produces no parsed or raw diagnostics, a generic error diagnostic is emitted to the affected files so errors never fail silently in the LSP.

nojaf mentioned this pull request Feb 9, 2026

Daemon architecture for rewatch #8231

Closed

nojaf added 5 commits March 16, 2026 10:11

Rust: rewatch LSP server, build pipeline, telemetry, llm_index

f55ab2f

OCaml: analysis commands, completions, hints, llm_index, syntax driver

cd00b98

Tests: replace old shell tests with vitest suite for rewatch

685ec8e

Otel viewer: Python/JS web app for OpenTelemetry trace viewing

7765e33

Config: CI, docs, Makefile, package.json, yarn.lock

32d3c52

nojaf force-pushed the rewatch-lsp branch from 0f179b8 to 32d3c52 Compare March 16, 2026 09:15

nojaf and others added 21 commits March 16, 2026 10:17

Update lock file

14db412

Use node:sqlite

fb5bb7b

Fix rewatch panic when package.json has no "name" field (rescript-lan…

61006e3

…g#8291) * Fix rewatch panic when package.json has no "name" field * CHANGELOG

Bump node version in integration test

bdffbeb

Add more LSP read http endpoints for the LLM

10c82c8

More agents.md and links

93950f2

fmt

7f4d8da

fmt

38d56cd

Add has_interface column to modules table and kind field to parse_fil…

89466aa

…e span

Avoid db race in lsp tests

f0af7c4

More agents.md link and otel viewer health point

ad7de5f

nojaf added 5 commits March 20, 2026 09:40

Add mutable column to fields table in rescript.db

fcbebad

Extract ld_mutable from the typedtree in the analysis binary, emit it in the llmIndex JSON output, and store it as an INTEGER column in the fields table.

Update snapshot

659c332

Put sqlite db behind a flag.

91001ab

Sort some names

9f6f81f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experimental: Unified LSP server in rewatch#8243

Experimental: Unified LSP server in rewatch#8243
nojaf wants to merge 31 commits intorescript-lang:masterfrom
nojaf:rewatch-lsp

nojaf commented Feb 9, 2026

Uh oh!

pkg-pr-new bot commented Feb 9, 2026 •

edited

Loading

Uh oh!

nojaf commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nojaf commented Feb 9, 2026

The core idea

No temp files — stdin everywhere

Separate build profile: lib/lsp

Initial build: typecheck only

Smart incremental builds on save

What's implemented

Observability

Test infrastructure

What's not here yet

Uh oh!

pkg-pr-new bot commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nojaf commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Separate build profile: `lib/lsp`

pkg-pr-new bot commented Feb 9, 2026 •

edited

Loading